Trying to improve phone and word recognition using finely tuned phone-like units
نویسندگان
چکیده
Phone-like units (PLUs) for automatic speech recognition are derived using a decision tree algorithm. In our approach we use information such as target phone label, immediate context, lexical stress level and function word affiliation in the decision tree analysis. The resulting PLUs are shown to improve phone and word recognition.
منابع مشابه
Effects of allophones on the performance of Korean speech recognition
This paper investigates the effects of allophones on the performance of Korean speech recognition systems. Along with a baseline phone-like unit (PLU) set consisting of phonemes, 31 allophone-based PLU sets are designed by systematically considering 5 major Korean allophonic constraints which can describe all the PLU sets currently used for Korean speech recognition systems. Experiments for pho...
متن کاملPersian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods
Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...
متن کاملImproving the Arabic Pronunciation Dictionary for Phone and Word Recognition with Linguistically-Based Pronunciation Rules
In this paper, we show that linguistically motivated pronunciation rules can improve phone and word recognition results for Modern Standard Arabic (MSA). Using these rules and the MADA morphological analysis and disambiguation tool, multiple pronunciations per word are automatically generated to build two pronunciation dictionaries; one for training and another for decoding. We demonstrate that...
متن کاملCommunication Behaviour of Farmers with the Agricultural Extension Agents Using Cell Phone: A Case of Bangladesh
The cell phone is one of the potential Information Communication Technologies (ICTs) in agricultural development especially in developing countries like Bangladesh. Thus, this paper deals with the farmers’ communication with the agricultural extension agents using mobile phone. The study was conducted in Mymensingh District in Bangladesh. Data were collected from a sample of 110 farmers while b...
متن کاملIncorporating information from syllable-length time scales into automatic speech recognition
Including information distributed over intervals of syllabic duration (100–250 ms) may greatly improve the performance of automatic speech recognition (ASR) systems. ASR systems primarily use representations and recognition units covering phonetic durations (40–100 ms). Humans certainly use information at phonetic time scales, but results from psychoacoustics and psycholinguistics highlight the...
متن کامل